On the Furthest Hyperplane Problem and Maximal Margin Clustering

نویسندگان

  • Edo Liberty
  • Shachar Lovett
  • Omri Weinstein
چکیده

This paper introduces the Furthest Hyperplane Problem (FHP), which is an unsupervised counterpart of Support Vector Machines. Given a set of n points in R, the objective is to produce the hyperplane (passing through the origin) which maximizes the separation margin, that is, the minimal distance between the hyperplane and any input point. To the best of our knowledge, this is the first paper achieving provable results regarding FHP. We provide both lower and upper bounds to this NP-hard problem. First, we give a simple randomized algorithm whose running time is n ) where θ is the optimal separation margin. We show that its exponential dependency on 1/θ is tight, up to sub-polynomial factors, assuming SAT cannot be solved in sub-exponential time. Next, we give an efficient approximation algorithm. For any α ∈ [0, 1], the algorithm produces a hyperplane whose distance from at least 1 − 5α fraction of the points is at least α times the optimal separation margin. Finally, we show that FHP does not admit a PTAS by presenting a gap preserving reduction from a particular version of the PCP theorem. ∗Yahoo! Research [email protected]. †Yahoo! Research, [email protected]. ‡IAS, [email protected]. Supported by DMS-0835373. §Technion Institute of Technology and Yahoo! Research [email protected]. ¶Princeton University and Yahoo! Research [email protected].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised SVMs: On the Complexity of the Furthest Hyperplane Problem

This paper introduces the Furthest Hyperplane Problem (FHP), which is an unsupervised counterpart of Support Vector Machines. Given a set of n points in R, the objective is to produce the hyperplane (passing through the origin) which maximizes the separation margin, that is, the minimal distance between the hyperplane and any input point. To the best of our knowledge, this is the first paper ac...

متن کامل

Bayes Optimal Hyperplanes! Maximal Margin Hyperplanes

Maximal margin classifiers are a core technology in modern machine learning. They have strong theoretical justifications and have shown empirical successes. We provide an alternative justification for maximal margin hyperplane classifiers by relating them to Bayes optimal classifiers that use Parzen windows estimations with Gaussian kernels. For any value of the smoothing parameter (the width o...

متن کامل

Indoor WLAN positioning Using Hybrid SVM Hyperplane Margin Clustering and Regression

This paper proposes a novel indoor Wireless Local Area Network (WLAN) positioning algorithm by using the Support Vector Machine (SVM) Hyperplane Margin Clustering and Regression (SVMCR). First of all, we rely on the SVM Hyperplane Margin Clustering (SVMC) to reduce the search space of the fingerprint database. Second, we use the Support Vector Regression (SVR) to characterize the relations of t...

متن کامل

Efficient Maximum Margin Clustering via Cutting Plane Algorithm

Maximum margin clustering (MMC) is a recently proposed clustering method, which extends the theory of support vector machine to the unsupervised scenario and aims at finding the maximum margin hyperplane which separates the data from different classes. Traditionally, MMC is formulated as a non-convex integer programming problem and is thus difficult to solve. Several methods have been proposed ...

متن کامل

Minimum Spectral Connectivity Projection Pursuit for Unsupervised Classification

We study the problem of determining the optimal univariate subspace for maximising the separability of a binary partition of unlabeled data, as measured by spectral graph theory. This is achieved by finding projections which minimise the second eigenvalue of the Laplacian matrices of the projected data, which corresponds to a non-convex, non-smooth optimisation problem. We show that the optimal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1107.1358  شماره 

صفحات  -

تاریخ انتشار 2011